Multi-Armed Bandit Mechanisms for Multi-Slot Sponsored Search Auctions

نویسندگان

  • Akash Das Sarma
  • Sujit Gujar
  • Y. Narahari
چکیده

In pay-per click sponsored search auctions which are currently extensively used by search engines, the auction for a keyword involves a certain number of advertisers (say k) competing for available slots (say m) to display their ads. This auction is typically conducted for a number of rounds (say T ). There are click probabilities μij associated with each agent-slot pairs. The goal of the search engine is to maximize social welfare of the advertisers, that is, the sum of values of the advertisers. The search engine does not know the true values advertisers have for a click to their respective ads and also does not know the click probabilities μijs. A key problem for the search engine therefore is to learn these click probabilities during the T rounds of the auction and also to ensure that the auction mechanism is truthful. Mechanisms for addressing such learning and incentives issues have recently been introduced and are aptly referred to as multi-armed-bandit (MAB) mechanisms. When m = 1, characterizations for truthful MAB mechanisms are available in the literature and it has been shown that the regret for such mechanisms will be O(T 2 3 ). In this paper, we seek to derive a characterization in the realistic but non-trivial general case when m > 1 and obtain several interesting results. Our contributions include: (1) When μijs are unconstrained , we prove that any truthful mechanism must satisfy strong pointwise monotonicity and show that the regret will be O(T ) for such mechanisms. (2) When the clicks on the ads follow a certain click precedence property , we show that weak pointwise monotonicity is necessary for MAB mechanisms to be truthful. (3) If the search engine has a certain coarse pre-estimate of μij values and wishes to update them during the course of the T rounds, we show that weak pointwise monotonicity and weakly separatedness are necessary and sufficient conditions for the MAB mechanisms to be truthful. (4) If the click probabilities are separable into agent specific and slot specific terms, we provide a characterization of MAB mechanisms that are truthful in expectation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Truthful multi-armed bandit mechanisms for multi-slot sponsored search auctions

In pay-per-click sponsored search auctions which are currently extensively used by search engines, the auction for a keyword involves a certain number of advertisers (say k) competing for available slots (say m) to display their advertisements (ads for short). A sponsored search auction for a keyword is typically conducted for a number of rounds (say T). There are click probabilities μij associ...

متن کامل

A novel ex-post truthful mechanism for multi-slot sponsored search auctions

In this paper, we advance the state-of-the-art in designing ex-post truthful multi-armed bandit (MAB) mechanisms for multi-slot sponsored search auctions (SSA) through two different contributions. First, we prove two important impossibility results which rule out the possibility of an expost monotone MAB allocation rule having sublinear regret with time when the click through rates (CTR) of the...

متن کامل

A Dominant Strategy Truthful, Deterministic Multi-Armed Bandit Mechanism with Logarithmic Regret

Stochastic multi-armed bandit (MAB) mechanisms are widely used in sponsored search auctions, crowdsourcing, online procurement, etc. Existing stochastic MAB mechanisms with a deterministic payment rule, proposed in the literature, necessarily suffer a regret of Ω(T ), where T is the number of time steps. This happens because the existing mechanisms consider the worst case scenario where the mea...

متن کامل

Showing Relevant Ads via Context Multi-Armed Bandits

We study context multi-armed bandit problems where the context comes from a metric space and the payoff satisfies a Lipschitz condition with respect to the metric. Abstractly, a context multi-armed bandit problem models a situation where, in a sequence of independent trials, an online algorithm chooses an action based on a given context (side information) from a set of possible actions so as to...

متن کامل

A truthful learning mechanism for multi-slot sponsored search auctions with externalities

In recent years, effective sponsored search auctions (SSAs) have been designed to incentivize advertisers (advs) to bid their truthful valuations and, at the same time, to assure both the advs and the auctioneer a non–negative utility. Nonetheless, when the click–through–rates (CTRs) of the advs are unknown to the auction, these mechanisms must be paired with a learning algorithm for the estima...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1001.1414  شماره 

صفحات  -

تاریخ انتشار 2010